Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 11286 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.6 MiB |
| Average record size in memory | 144.0 B |
Variable types
| Numeric | 7 |
|---|---|
| DateTime | 1 |
| Categorical | 8 |
| Text | 2 |
country has constant value "" | Constant |
total_transaction_revenue is highly overall correlated with products | High correlation |
total_hits is highly overall correlated with total_pageviews and 2 other fields | High correlation |
total_pageviews is highly overall correlated with total_hits and 2 other fields | High correlation |
total_time_on_site is highly overall correlated with total_hits and 1 other fields | High correlation |
products is highly overall correlated with total_transaction_revenue and 2 other fields | High correlation |
channel_grouping is highly overall correlated with traffic_source | High correlation |
traffic_source is highly overall correlated with channel_grouping | High correlation |
kmeans_cluster is highly overall correlated with agg_cluster and 1 other fields | High correlation |
agg_cluster is highly overall correlated with kmeans_cluster and 1 other fields | High correlation |
device_category is highly overall correlated with kmeans_cluster and 1 other fields | High correlation |
browser is highly imbalanced (81.1%) | Imbalance |
traffic_source is highly imbalanced (79.4%) | Imbalance |
kmeans_cluster is highly imbalanced (63.3%) | Imbalance |
dbscan_cluster is highly imbalanced (74.7%) | Imbalance |
agg_cluster is highly imbalanced (52.0%) | Imbalance |
device_category is highly imbalanced (69.7%) | Imbalance |
total_transaction_revenue is highly skewed (γ1 = 24.78989535) | Skewed |
Reproduction
| Analysis started | 2024-10-01 19:34:24.841072 |
|---|---|
| Analysis finished | 2024-10-01 19:34:30.873088 |
| Duration | 6.03 seconds |
| Software version | ydata-profiling vv4.6.1 |
| Download configuration | config.json |
visitor_id
Real number (ℝ)
| Distinct | 9502 |
|---|---|
| Distinct (%) | 84.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.5075969 × 1018 |
| Minimum | 2.1313114 × 1014 |
|---|---|
| Maximum | 9.998996 × 1018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 88.3 KiB |
Quantile statistics
| Minimum | 2.1313114 × 1014 |
|---|---|
| 5-th percentile | 1.9855432 × 1017 |
| Q1 | 1.646748 × 1018 |
| median | 4.3876095 × 1018 |
| Q3 | 7.1845698 × 1018 |
| 95-th percentile | 9.4477648 × 1018 |
| Maximum | 9.998996 × 1018 |
| Range | 9.9987829 × 1018 |
| Interquartile range (IQR) | 5.5378218 × 1018 |
Descriptive statistics
| Standard deviation | 3.0640877 × 1018 |
|---|---|
| Coefficient of variation (CV) | 0.6797608 |
| Kurtosis | -1.2702596 |
| Mean | 4.5075969 × 1018 |
| Median Absolute Deviation (MAD) | 2.7608994 × 1018 |
| Skewness | 0.13240198 |
| Sum | 5.0872739 × 1022 |
| Variance | 9.3886334 × 1036 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 7.813149961 × 1018 | 35 | 0.3% |
| 6.760732402 × 1018 | 25 | 0.2% |
| 1.957458976 × 1018 | 22 | 0.2% |
| 4.984366501 × 1018 | 17 | 0.2% |
| 2.4025272 × 1018 | 15 | 0.1% |
| 6.089151977 × 1017 | 14 | 0.1% |
| 9.662800125 × 1018 | 12 | 0.1% |
| 7.311242886 × 1018 | 12 | 0.1% |
| 5.526675926 × 1018 | 12 | 0.1% |
| 7.71301243 × 1018 | 11 | 0.1% |
| Other values (9492) | 11111 |
| Value | Count | Frequency (%) |
| 2.131311426 × 1014 | 1 | |
| 4.353240613 × 1014 | 1 | |
| 5.62678147 × 1014 | 2 | |
| 5.85708896 × 1014 | 1 | |
| 8.528012638 × 1014 | 1 | |
| 1.123528056 × 1015 | 1 | |
| 1.905118576 × 1015 | 1 | |
| 2.527528149 × 1015 | 1 | |
| 2.709834583 × 1015 | 1 | |
| 2.838359589 × 1015 | 1 |
| Value | Count | Frequency (%) |
| 9.998996003 × 1018 | 1 | |
| 9.998597322 × 1018 | 1 | |
| 9.997409247 × 1018 | 1 | |
| 9.994767073 × 1018 | 1 | |
| 9.991633376 × 1018 | 1 | |
| 9.990797197 × 1018 | 1 | |
| 9.990183617 × 1018 | 2 | |
| 9.989795984 × 1018 | 1 | |
| 9.989256027 × 1018 | 1 | |
| 9.988700587 × 1018 | 1 |
visit_date
Date
| Distinct | 366 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 88.3 KiB |
| Minimum | 2016-08-01 00:00:00 |
|---|---|
| Maximum | 2017-08-02 00:00:00 |
total_transaction_revenue
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 6059 |
|---|---|
| Distinct (%) | 53.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.469939 × 108 |
| Minimum | 1200000 |
|---|---|
| Maximum | 2.395256 × 1010 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 88.3 KiB |
Quantile statistics
| Minimum | 1200000 |
|---|---|
| 5-th percentile | 14990000 |
| Q1 | 29882500 |
| median | 54755000 |
| Q3 | 1.139325 × 108 |
| 95-th percentile | 5.227675 × 108 |
| Maximum | 2.395256 × 1010 |
| Range | 2.395136 × 1010 |
| Interquartile range (IQR) | 84050000 |
Descriptive statistics
| Standard deviation | 5.6179179 × 108 |
|---|---|
| Coefficient of variation (CV) | 3.8218713 |
| Kurtosis | 823.04591 |
| Mean | 1.469939 × 108 |
| Median Absolute Deviation (MAD) | 30775000 |
| Skewness | 24.789895 |
| Sum | 1.6589732 × 1012 |
| Variance | 3.1561001 × 1017 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23990000 | 93 | 0.8% |
| 24990000 | 86 | 0.8% |
| 25990000 | 82 | 0.7% |
| 22990000 | 80 | 0.7% |
| 21990000 | 80 | 0.7% |
| 19990000 | 68 | 0.6% |
| 20990000 | 59 | 0.5% |
| 18990000 | 57 | 0.5% |
| 17990000 | 56 | 0.5% |
| 26990000 | 49 | 0.4% |
| Other values (6049) | 10576 |
| Value | Count | Frequency (%) |
| 1200000 | 1 | < 0.1% |
| 2040000 | 1 | < 0.1% |
| 2200000 | 1 | < 0.1% |
| 2490000 | 1 | < 0.1% |
| 2500000 | 1 | < 0.1% |
| 2990000 | 7 | |
| 3010000 | 1 | < 0.1% |
| 3200000 | 2 | < 0.1% |
| 3400000 | 1 | < 0.1% |
| 3500000 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2.395256 × 1010 | 1 | |
| 2.31365 × 1010 | 1 | |
| 1.78595 × 1010 | 1 | |
| 1.603275 × 1010 | 1 | |
| 1.563461 × 1010 | 1 | |
| 1.466012 × 1010 | 1 | |
| 1.151181 × 1010 | 1 | |
| 1.090777 × 1010 | 1 | |
| 1.059514 × 1010 | 1 | |
| 8680830000 | 1 |
channel_grouping
Categorical
HIGH CORRELATION 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 88.3 KiB |
| Referral | |
|---|---|
| Organic Search | |
| Direct | |
| Paid Search | 472 |
| Display | 150 |
| Other values (3) | 107 |
Length
| Max length | 14 |
|---|---|
| Median length | 11 |
| Mean length | 9.4300018 |
| Min length | 6 |
Characters and Unicode
| Total characters | 106427 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Direct |
|---|---|
| 2nd row | Referral |
| 3rd row | Referral |
| 4th row | Referral |
| 5th row | Referral |
Common Values
| Value | Count | Frequency (%) |
| Referral | 5354 | |
| Organic Search | 3182 | |
| Direct | 2021 | 17.9% |
| Paid Search | 472 | 4.2% |
| Display | 150 | 1.3% |
| Social | 97 | 0.9% |
| Affiliates | 9 | 0.1% |
| (Other) | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| referral | 5354 | |
| search | 3654 | |
| organic | 3182 | |
| direct | 2021 | 13.5% |
| paid | 472 | 3.2% |
| display | 150 | 1.0% |
| social | 97 | 0.6% |
| affiliates | 9 | 0.1% |
| other | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 19566 | |
| e | 16393 | |
| a | 12918 | |
| c | 8954 | |
| i | 5940 | 5.6% |
| l | 5610 | 5.3% |
| f | 5372 | 5.0% |
| R | 5354 | 5.0% |
| S | 3751 | 3.5% |
| h | 3655 | 3.4% |
| Other values (15) | 18914 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 87831 | |
| Uppercase Letter | 14940 | 14.0% |
| Space Separator | 3654 | 3.4% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 19566 | |
| e | 16393 | |
| a | 12918 | |
| c | 8954 | |
| i | 5940 | 6.8% |
| l | 5610 | 6.4% |
| f | 5372 | 6.1% |
| h | 3655 | 4.2% |
| n | 3182 | 3.6% |
| g | 3182 | 3.6% |
| Other values (6) | 3059 | 3.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 5354 | |
| S | 3751 | |
| O | 3183 | |
| D | 2171 | |
| P | 472 | 3.2% |
| A | 9 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3654 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 102771 | |
| Common | 3656 | 3.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 19566 | |
| e | 16393 | |
| a | 12918 | |
| c | 8954 | |
| i | 5940 | 5.8% |
| l | 5610 | 5.5% |
| f | 5372 | 5.2% |
| R | 5354 | 5.2% |
| S | 3751 | 3.6% |
| h | 3655 | 3.6% |
| Other values (12) | 15258 |
Common
| Value | Count | Frequency (%) |
| 3654 | ||
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 106427 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 19566 | |
| e | 16393 | |
| a | 12918 | |
| c | 8954 | |
| i | 5940 | 5.6% |
| l | 5610 | 5.3% |
| f | 5372 | 5.0% |
| R | 5354 | 5.0% |
| S | 3751 | 3.5% |
| h | 3655 | 3.4% |
| Other values (15) | 18914 |
browser
Categorical
IMBALANCE 
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 88.3 KiB |
| Chrome | |
|---|---|
| Safari | 725 |
| Firefox | 179 |
| Internet Explorer | 100 |
| Edge | 53 |
| Other values (4) | 25 |
Length
| Max length | 17 |
|---|---|
| Median length | 6 |
| Mean length | 6.1181995 |
| Min length | 4 |
Characters and Unicode
| Total characters | 69050 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Chrome |
|---|---|
| 2nd row | Chrome |
| 3rd row | Chrome |
| 4th row | Chrome |
| 5th row | Chrome |
Common Values
| Value | Count | Frequency (%) |
| Chrome | 10204 | |
| Safari | 725 | 6.4% |
| Firefox | 179 | 1.6% |
| Internet Explorer | 100 | 0.9% |
| Edge | 53 | 0.5% |
| Safari (in-app) | 12 | 0.1% |
| Opera | 6 | 0.1% |
| Android Webview | 6 | 0.1% |
| Amazon Silk | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| chrome | 10204 | |
| safari | 737 | 6.5% |
| firefox | 179 | 1.6% |
| internet | 100 | 0.9% |
| explorer | 100 | 0.9% |
| edge | 53 | 0.5% |
| in-app | 12 | 0.1% |
| opera | 6 | 0.1% |
| android | 6 | 0.1% |
| webview | 6 | 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 11432 | |
| e | 10754 | |
| o | 10490 | |
| m | 10205 | |
| C | 10204 | |
| h | 10204 | |
| a | 1493 | 2.2% |
| i | 941 | 1.4% |
| f | 916 | 1.3% |
| S | 738 | 1.1% |
| Other values (22) | 1673 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 57502 | |
| Uppercase Letter | 11393 | 16.5% |
| Space Separator | 119 | 0.2% |
| Dash Punctuation | 12 | < 0.1% |
| Close Punctuation | 12 | < 0.1% |
| Open Punctuation | 12 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 11432 | |
| e | 10754 | |
| o | 10490 | |
| m | 10205 | |
| h | 10204 | |
| a | 1493 | 2.6% |
| i | 941 | 1.6% |
| f | 916 | 1.6% |
| x | 279 | 0.5% |
| n | 219 | 0.4% |
| Other values (10) | 569 | 1.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 10204 | |
| S | 738 | 6.5% |
| F | 179 | 1.6% |
| E | 153 | 1.3% |
| I | 100 | 0.9% |
| A | 7 | 0.1% |
| O | 6 | 0.1% |
| W | 6 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 119 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 12 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 68895 | |
| Common | 155 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 11432 | |
| e | 10754 | |
| o | 10490 | |
| m | 10205 | |
| C | 10204 | |
| h | 10204 | |
| a | 1493 | 2.2% |
| i | 941 | 1.4% |
| f | 916 | 1.3% |
| S | 738 | 1.1% |
| Other values (18) | 1518 | 2.2% |
Common
| Value | Count | Frequency (%) |
| 119 | ||
| - | 12 | 7.7% |
| ) | 12 | 7.7% |
| ( | 12 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 69050 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 11432 | |
| e | 10754 | |
| o | 10490 | |
| m | 10205 | |
| C | 10204 | |
| h | 10204 | |
| a | 1493 | 2.2% |
| i | 941 | 1.4% |
| f | 916 | 1.3% |
| S | 738 | 1.1% |
| Other values (22) | 1673 | 2.4% |
traffic_source
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 40 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 88.3 KiB |
| (direct) | |
|---|---|
| dfa | 130 |
| mail.google.com | 60 |
| sites.google.com | 43 |
| Other values (35) | 231 |
Length
| Max length | 25 |
|---|---|
| Median length | 8 |
| Mean length | 7.7078682 |
| Min length | 3 |
Characters and Unicode
| Total characters | 86991 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 15 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | (direct) |
|---|---|
| 2nd row | (direct) |
| 3rd row | (direct) |
| 4th row | (direct) |
| 5th row | (direct) |
Common Values
| Value | Count | Frequency (%) |
| (direct) | 8648 | |
| 2174 | 19.3% | |
| dfa | 130 | 1.2% |
| mail.google.com | 60 | 0.5% |
| sites.google.com | 43 | 0.4% |
| dealspotr.com | 39 | 0.3% |
| groups.google.com | 37 | 0.3% |
| yahoo | 22 | 0.2% |
| bing | 20 | 0.2% |
| facebook.com | 14 | 0.1% |
| Other values (30) | 99 | 0.9% |
Length
| Value | Count | Frequency (%) |
| direct | 8648 | |
| 2174 | 19.3% | |
| dfa | 130 | 1.2% |
| mail.google.com | 60 | 0.5% |
| sites.google.com | 43 | 0.4% |
| dealspotr.com | 39 | 0.3% |
| groups.google.com | 37 | 0.3% |
| yahoo | 22 | 0.2% |
| bing | 20 | 0.2% |
| facebook.com | 14 | 0.1% |
| Other values (30) | 99 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 11146 | |
| c | 8982 | |
| d | 8837 | |
| i | 8805 | |
| t | 8779 | |
| r | 8765 | |
| ( | 8648 | |
| ) | 8648 | |
| o | 5188 | |
| g | 4732 | |
| Other values (20) | 4461 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 69211 | |
| Open Punctuation | 8648 | 9.9% |
| Close Punctuation | 8648 | 9.9% |
| Other Punctuation | 473 | 0.5% |
| Uppercase Letter | 9 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
| Decimal Number | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11146 | |
| c | 8982 | |
| d | 8837 | |
| i | 8805 | |
| t | 8779 | |
| r | 8765 | |
| o | 5188 | |
| g | 4732 | |
| l | 2476 | 3.6% |
| m | 355 | 0.5% |
| Other values (14) | 1146 | 1.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 8648 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 8648 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 473 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 9 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 69220 | |
| Common | 17771 | 20.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 11146 | |
| c | 8982 | |
| d | 8837 | |
| i | 8805 | |
| t | 8779 | |
| r | 8765 | |
| o | 5188 | |
| g | 4732 | |
| l | 2476 | 3.6% |
| m | 355 | 0.5% |
| Other values (15) | 1155 | 1.7% |
Common
| Value | Count | Frequency (%) |
| ( | 8648 | |
| ) | 8648 | |
| . | 473 | 2.7% |
| - | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 86991 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 11146 | |
| c | 8982 | |
| d | 8837 | |
| i | 8805 | |
| t | 8779 | |
| r | 8765 | |
| ( | 8648 | |
| ) | 8648 | |
| o | 5188 | |
| g | 4732 | |
| Other values (20) | 4461 |
country
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 88.3 KiB |
| United States |
|---|
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Characters and Unicode
| Total characters | 146718 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | United States |
| 3rd row | United States |
| 4th row | United States |
| 5th row | United States |
Common Values
| Value | Count | Frequency (%) |
| United States | 11286 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| united | 11286 | |
| states | 11286 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 33858 | |
| e | 22572 | |
| U | 11286 | 7.7% |
| n | 11286 | 7.7% |
| i | 11286 | 7.7% |
| d | 11286 | 7.7% |
| 11286 | 7.7% | |
| S | 11286 | 7.7% |
| a | 11286 | 7.7% |
| s | 11286 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 112860 | |
| Uppercase Letter | 22572 | 15.4% |
| Space Separator | 11286 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 33858 | |
| e | 22572 | |
| n | 11286 | 10.0% |
| i | 11286 | 10.0% |
| d | 11286 | 10.0% |
| a | 11286 | 10.0% |
| s | 11286 | 10.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 11286 | |
| S | 11286 |
Space Separator
| Value | Count | Frequency (%) |
| 11286 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 135432 | |
| Common | 11286 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 33858 | |
| e | 22572 | |
| U | 11286 | 8.3% |
| n | 11286 | 8.3% |
| i | 11286 | 8.3% |
| d | 11286 | 8.3% |
| S | 11286 | 8.3% |
| a | 11286 | 8.3% |
| s | 11286 | 8.3% |
Common
| Value | Count | Frequency (%) |
| 11286 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 146718 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 33858 | |
| e | 22572 | |
| U | 11286 | 7.7% |
| n | 11286 | 7.7% |
| i | 11286 | 7.7% |
| d | 11286 | 7.7% |
| 11286 | 7.7% | |
| S | 11286 | 7.7% |
| a | 11286 | 7.7% |
| s | 11286 | 7.7% |
cities
Text
| Distinct | 2170 |
|---|---|
| Distinct (%) | 19.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 88.3 KiB |
Length
| Max length | 32 |
|---|---|
| Median length | 24 |
| Mean length | 9.2098175 |
| Min length | 3 |
Characters and Unicode
| Total characters | 103942 |
|---|---|
| Distinct characters | 63 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 593 ? |
|---|---|
| Unique (%) | 5.3% |
Sample
| 1st row | Spokane |
|---|---|
| 2nd row | Pooler |
| 3rd row | Chesterfield |
| 4th row | Chesterfield |
| 5th row | Murfreesboro |
| Value | Count | Frequency (%) |
| city | 321 | 2.1% |
| west | 206 | 1.4% |
| falls | 201 | 1.3% |
| park | 181 | 1.2% |
| south | 162 | 1.1% |
| north | 151 | 1.0% |
| valley | 121 | 0.8% |
| eagle | 121 | 0.8% |
| east | 115 | 0.8% |
| saint | 98 | 0.7% |
| Other values (1978) | 13350 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 9578 | 9.2% |
| a | 9176 | 8.8% |
| n | 7636 | 7.3% |
| o | 7273 | 7.0% |
| l | 7009 | 6.7% |
| r | 6659 | 6.4% |
| i | 6115 | 5.9% |
| t | 5752 | 5.5% |
| s | 4696 | 4.5% |
| 3741 | 3.6% | |
| Other values (53) | 36307 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 84902 | |
| Uppercase Letter | 15140 | 14.6% |
| Space Separator | 3741 | 3.6% |
| Dash Punctuation | 90 | 0.1% |
| Other Punctuation | 60 | 0.1% |
| Decimal Number | 3 | < 0.1% |
| Initial Punctuation | 2 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 9578 | |
| a | 9176 | |
| n | 7636 | |
| o | 7273 | 8.6% |
| l | 7009 | 8.3% |
| r | 6659 | 7.8% |
| i | 6115 | 7.2% |
| t | 5752 | 6.8% |
| s | 4696 | 5.5% |
| d | 2819 | 3.3% |
| Other values (19) | 18189 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1590 | 10.5% |
| S | 1380 | 9.1% |
| M | 1271 | 8.4% |
| B | 1227 | 8.1% |
| P | 948 | 6.3% |
| W | 898 | 5.9% |
| L | 850 | 5.6% |
| F | 834 | 5.5% |
| H | 751 | 5.0% |
| A | 702 | 4.6% |
| Other values (16) | 4689 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 42 | |
| . | 18 |
Space Separator
| Value | Count | Frequency (%) |
| 3741 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 90 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3 |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 2 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 100042 | |
| Common | 3900 | 3.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 9578 | 9.6% |
| a | 9176 | 9.2% |
| n | 7636 | 7.6% |
| o | 7273 | 7.3% |
| l | 7009 | 7.0% |
| r | 6659 | 6.7% |
| i | 6115 | 6.1% |
| t | 5752 | 5.7% |
| s | 4696 | 4.7% |
| d | 2819 | 2.8% |
| Other values (45) | 33329 |
Common
| Value | Count | Frequency (%) |
| 3741 | ||
| - | 90 | 2.3% |
| ' | 42 | 1.1% |
| . | 18 | 0.5% |
| 1 | 3 | 0.1% |
| ‘ | 2 | 0.1% |
| ( | 2 | 0.1% |
| ) | 2 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 103932 | |
| None | 8 | < 0.1% |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 9578 | 9.2% |
| a | 9176 | 8.8% |
| n | 7636 | 7.3% |
| o | 7273 | 7.0% |
| l | 7009 | 6.7% |
| r | 6659 | 6.4% |
| i | 6115 | 5.9% |
| t | 5752 | 5.5% |
| s | 4696 | 4.5% |
| 3741 | 3.6% | |
| Other values (49) | 36297 |
None
| Value | Count | Frequency (%) |
| ñ | 5 | |
| ā | 2 | 25.0% |
| ī | 1 | 12.5% |
Punctuation
| Value | Count | Frequency (%) |
| ‘ | 2 |
region
Text
| Distinct | 51 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 88.3 KiB |
Length
| Max length | 20 |
|---|---|
| Median length | 12 |
| Mean length | 8.4885699 |
| Min length | 4 |
Characters and Unicode
| Total characters | 95802 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Washington |
|---|---|
| 2nd row | Georgia |
| 3rd row | Missouri |
| 4th row | Missouri |
| 5th row | Tennessee |
| Value | Count | Frequency (%) |
| new | 1202 | 8.6% |
| dakota | 632 | 4.5% |
| south | 604 | 4.3% |
| california | 485 | 3.5% |
| north | 476 | 3.4% |
| carolina | 448 | 3.2% |
| texas | 438 | 3.1% |
| idaho | 436 | 3.1% |
| washington | 434 | 3.1% |
| pennsylvania | 418 | 3.0% |
| Other values (45) | 8419 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 12588 | |
| o | 8731 | 9.1% |
| i | 8069 | 8.4% |
| n | 7458 | 7.8% |
| e | 6390 | 6.7% |
| s | 5876 | 6.1% |
| r | 5074 | 5.3% |
| t | 3915 | 4.1% |
| l | 3512 | 3.7% |
| h | 3195 | 3.3% |
| Other values (36) | 30994 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 79242 | |
| Uppercase Letter | 13854 | 14.5% |
| Space Separator | 2706 | 2.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 12588 | |
| o | 8731 | |
| i | 8069 | |
| n | 7458 | |
| e | 6390 | |
| s | 5876 | |
| r | 5074 | 6.4% |
| t | 3915 | 4.9% |
| l | 3512 | 4.4% |
| h | 3195 | 4.0% |
| Other values (14) | 14434 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2194 | |
| M | 1544 | |
| C | 1401 | |
| I | 1249 | 9.0% |
| D | 859 | 6.2% |
| A | 835 | 6.0% |
| W | 709 | 5.1% |
| T | 657 | 4.7% |
| S | 604 | 4.4% |
| K | 556 | 4.0% |
| Other values (11) | 3246 |
Space Separator
| Value | Count | Frequency (%) |
| 2706 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 93096 | |
| Common | 2706 | 2.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 12588 | |
| o | 8731 | 9.4% |
| i | 8069 | 8.7% |
| n | 7458 | 8.0% |
| e | 6390 | 6.9% |
| s | 5876 | 6.3% |
| r | 5074 | 5.5% |
| t | 3915 | 4.2% |
| l | 3512 | 3.8% |
| h | 3195 | 3.4% |
| Other values (35) | 28288 |
Common
| Value | Count | Frequency (%) |
| 2706 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 95802 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 12588 | |
| o | 8731 | 9.1% |
| i | 8069 | 8.4% |
| n | 7458 | 7.8% |
| e | 6390 | 6.7% |
| s | 5876 | 6.1% |
| r | 5074 | 5.3% |
| t | 3915 | 4.1% |
| l | 3512 | 3.7% |
| h | 3195 | 3.3% |
| Other values (36) | 30994 |
total_hits
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 204 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.454014 |
| Minimum | 3 |
|---|---|
| Maximum | 500 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 88.3 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 19 |
| median | 29 |
| Q3 | 45 |
| 95-th percentile | 89 |
| Maximum | 500 |
| Range | 497 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 34.417162 |
|---|---|
| Coefficient of variation (CV) | 0.91891785 |
| Kurtosis | 55.117436 |
| Mean | 37.454014 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 5.6419069 |
| Sum | 422706 |
| Variance | 1184.541 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 18 | 363 | 3.2% |
| 16 | 351 | 3.1% |
| 15 | 342 | 3.0% |
| 14 | 337 | 3.0% |
| 19 | 336 | 3.0% |
| 17 | 334 | 3.0% |
| 20 | 315 | 2.8% |
| 23 | 302 | 2.7% |
| 13 | 301 | 2.7% |
| 21 | 297 | 2.6% |
| Other values (194) | 8008 |
| Value | Count | Frequency (%) |
| 3 | 3 | < 0.1% |
| 4 | 8 | 0.1% |
| 5 | 12 | 0.1% |
| 6 | 8 | 0.1% |
| 7 | 18 | 0.2% |
| 8 | 53 | 0.5% |
| 9 | 91 | 0.8% |
| 10 | 113 | |
| 11 | 185 | |
| 12 | 245 |
| Value | Count | Frequency (%) |
| 500 | 12 | |
| 471 | 1 | < 0.1% |
| 387 | 6 | |
| 386 | 6 | |
| 385 | 1 | < 0.1% |
| 382 | 1 | < 0.1% |
| 361 | 1 | < 0.1% |
| 331 | 1 | < 0.1% |
| 328 | 1 | < 0.1% |
| 311 | 1 | < 0.1% |
total_pageviews
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 153 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.290537 |
| Minimum | 3 |
|---|---|
| Maximum | 466 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 88.3 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 16 |
| median | 23 |
| Q3 | 35 |
| 95-th percentile | 64 |
| Maximum | 466 |
| Range | 463 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 26.395584 |
|---|---|
| Coefficient of variation (CV) | 0.90116421 |
| Kurtosis | 104.4019 |
| Mean | 29.290537 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 7.8665297 |
| Sum | 330573 |
| Variance | 696.72683 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 462 | 4.1% |
| 15 | 453 | 4.0% |
| 14 | 452 | 4.0% |
| 13 | 437 | 3.9% |
| 18 | 436 | 3.9% |
| 17 | 431 | 3.8% |
| 21 | 396 | 3.5% |
| 20 | 388 | 3.4% |
| 12 | 370 | 3.3% |
| 22 | 357 | 3.2% |
| Other values (143) | 7104 |
| Value | Count | Frequency (%) |
| 3 | 3 | < 0.1% |
| 4 | 8 | 0.1% |
| 5 | 12 | 0.1% |
| 6 | 8 | 0.1% |
| 7 | 23 | 0.2% |
| 8 | 81 | 0.7% |
| 9 | 133 | 1.2% |
| 10 | 242 | |
| 11 | 355 | |
| 12 | 370 |
| Value | Count | Frequency (%) |
| 466 | 12 | |
| 343 | 6 | |
| 341 | 6 | |
| 305 | 1 | < 0.1% |
| 270 | 1 | < 0.1% |
| 233 | 1 | < 0.1% |
| 232 | 1 | < 0.1% |
| 224 | 1 | < 0.1% |
| 208 | 1 | < 0.1% |
| 202 | 1 | < 0.1% |
total_time_on_site
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 2716 |
|---|---|
| Distinct (%) | 24.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1077.0158 |
| Minimum | 9 |
|---|---|
| Maximum | 15047 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 88.3 KiB |
Quantile statistics
| Minimum | 9 |
|---|---|
| 5-th percentile | 226 |
| Q1 | 460 |
| median | 782 |
| Q3 | 1374 |
| 95-th percentile | 2846.75 |
| Maximum | 15047 |
| Range | 15038 |
| Interquartile range (IQR) | 914 |
Descriptive statistics
| Standard deviation | 964.72145 |
|---|---|
| Coefficient of variation (CV) | 0.89573568 |
| Kurtosis | 16.999072 |
| Mean | 1077.0158 |
| Median Absolute Deviation (MAD) | 394 |
| Skewness | 2.9828584 |
| Sum | 12155200 |
| Variance | 930687.47 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 611 | 19 | 0.2% |
| 439 | 18 | 0.2% |
| 360 | 18 | 0.2% |
| 352 | 18 | 0.2% |
| 356 | 18 | 0.2% |
| 307 | 17 | 0.2% |
| 830 | 17 | 0.2% |
| 388 | 17 | 0.2% |
| 500 | 17 | 0.2% |
| 568 | 17 | 0.2% |
| Other values (2706) | 11110 |
| Value | Count | Frequency (%) |
| 9 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| 36 | 1 | < 0.1% |
| 56 | 2 | < 0.1% |
| 64 | 1 | < 0.1% |
| 77 | 1 | < 0.1% |
| 83 | 2 | < 0.1% |
| 95 | 1 | < 0.1% |
| 96 | 5 | |
| 97 | 3 |
| Value | Count | Frequency (%) |
| 15047 | 1 | < 0.1% |
| 12136 | 1 | < 0.1% |
| 11094 | 2 | < 0.1% |
| 9564 | 1 | < 0.1% |
| 9275 | 2 | < 0.1% |
| 8999 | 1 | < 0.1% |
| 8811 | 1 | < 0.1% |
| 8805 | 6 | |
| 8369 | 1 | < 0.1% |
| 7433 | 1 | < 0.1% |
visit_number
Real number (ℝ)
| Distinct | 109 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.3429027 |
| Minimum | 1 |
|---|---|
| Maximum | 315 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 88.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 12 |
| Maximum | 315 |
| Range | 314 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 14.239769 |
|---|---|
| Coefficient of variation (CV) | 3.2788598 |
| Kurtosis | 281.90641 |
| Mean | 4.3429027 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 15.333745 |
| Sum | 49014 |
| Variance | 202.77102 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 4330 | |
| 2 | 2437 | |
| 3 | 1393 | 12.3% |
| 4 | 876 | 7.8% |
| 5 | 549 | 4.9% |
| 6 | 345 | 3.1% |
| 7 | 243 | 2.2% |
| 8 | 179 | 1.6% |
| 9 | 155 | 1.4% |
| 10 | 129 | 1.1% |
| Other values (99) | 650 | 5.8% |
| Value | Count | Frequency (%) |
| 1 | 4330 | |
| 2 | 2437 | |
| 3 | 1393 | 12.3% |
| 4 | 876 | 7.8% |
| 5 | 549 | 4.9% |
| 6 | 345 | 3.1% |
| 7 | 243 | 2.2% |
| 8 | 179 | 1.6% |
| 9 | 155 | 1.4% |
| 10 | 129 | 1.1% |
| Value | Count | Frequency (%) |
| 315 | 1 | < 0.1% |
| 312 | 1 | < 0.1% |
| 305 | 1 | < 0.1% |
| 303 | 2 | |
| 300 | 1 | < 0.1% |
| 299 | 3 | |
| 296 | 1 | < 0.1% |
| 295 | 1 | < 0.1% |
| 293 | 1 | < 0.1% |
| 259 | 1 | < 0.1% |
kmeans_cluster
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 88.3 KiB |
| 0 | |
|---|---|
| 3 | 814 |
| 1 | 494 |
| 2 | 160 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 11286 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 9818 | |
| 3 | 814 | 7.2% |
| 1 | 494 | 4.4% |
| 2 | 160 | 1.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 9818 | |
| 3 | 814 | 7.2% |
| 1 | 494 | 4.4% |
| 2 | 160 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9818 | |
| 3 | 814 | 7.2% |
| 1 | 494 | 4.4% |
| 2 | 160 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11286 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9818 | |
| 3 | 814 | 7.2% |
| 1 | 494 | 4.4% |
| 2 | 160 | 1.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11286 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 9818 | |
| 3 | 814 | 7.2% |
| 1 | 494 | 4.4% |
| 2 | 160 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11286 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 9818 | |
| 3 | 814 | 7.2% |
| 1 | 494 | 4.4% |
| 2 | 160 | 1.4% |
dbscan_cluster
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 88.3 KiB |
| -1 | |
|---|---|
| 0 | 349 |
| 1 | 266 |
| 2 | 241 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.9241538 |
| Min length | 1 |
Characters and Unicode
| Total characters | 21716 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | -1 |
|---|---|
| 2nd row | -1 |
| 3rd row | -1 |
| 4th row | -1 |
| 5th row | -1 |
Common Values
| Value | Count | Frequency (%) |
| -1 | 10430 | |
| 0 | 349 | 3.1% |
| 1 | 266 | 2.4% |
| 2 | 241 | 2.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 10696 | |
| 0 | 349 | 3.1% |
| 2 | 241 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 10696 | |
| - | 10430 | |
| 0 | 349 | 1.6% |
| 2 | 241 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11286 | |
| Dash Punctuation | 10430 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 10696 | |
| 0 | 349 | 3.1% |
| 2 | 241 | 2.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10430 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 21716 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 10696 | |
| - | 10430 | |
| 0 | 349 | 1.6% |
| 2 | 241 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21716 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 10696 | |
| - | 10430 | |
| 0 | 349 | 1.6% |
| 2 | 241 | 1.1% |
agg_cluster
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 88.3 KiB |
| 2 | |
|---|---|
| 0 | |
| 1 | 814 |
| 3 | 154 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 11286 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 9078 | |
| 0 | 1240 | 11.0% |
| 1 | 814 | 7.2% |
| 3 | 154 | 1.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 9078 | |
| 0 | 1240 | 11.0% |
| 1 | 814 | 7.2% |
| 3 | 154 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 9078 | |
| 0 | 1240 | 11.0% |
| 1 | 814 | 7.2% |
| 3 | 154 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11286 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 9078 | |
| 0 | 1240 | 11.0% |
| 1 | 814 | 7.2% |
| 3 | 154 | 1.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11286 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 9078 | |
| 0 | 1240 | 11.0% |
| 1 | 814 | 7.2% |
| 3 | 154 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11286 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 9078 | |
| 0 | 1240 | 11.0% |
| 1 | 814 | 7.2% |
| 3 | 154 | 1.4% |
device_category
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 88.3 KiB |
| desktop | |
|---|---|
| mobile | 814 |
| tablet | 160 |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.9136984 |
| Min length | 6 |
Characters and Unicode
| Total characters | 78028 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | desktop |
|---|---|
| 2nd row | desktop |
| 3rd row | desktop |
| 4th row | desktop |
| 5th row | desktop |
Common Values
| Value | Count | Frequency (%) |
| desktop | 10312 | |
| mobile | 814 | 7.2% |
| tablet | 160 | 1.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| desktop | 10312 | |
| mobile | 814 | 7.2% |
| tablet | 160 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 11286 | |
| o | 11126 | |
| t | 10632 | |
| d | 10312 | |
| s | 10312 | |
| k | 10312 | |
| p | 10312 | |
| b | 974 | 1.2% |
| l | 974 | 1.2% |
| m | 814 | 1.0% |
| Other values (2) | 974 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 78028 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11286 | |
| o | 11126 | |
| t | 10632 | |
| d | 10312 | |
| s | 10312 | |
| k | 10312 | |
| p | 10312 | |
| b | 974 | 1.2% |
| l | 974 | 1.2% |
| m | 814 | 1.0% |
| Other values (2) | 974 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 78028 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 11286 | |
| o | 11126 | |
| t | 10632 | |
| d | 10312 | |
| s | 10312 | |
| k | 10312 | |
| p | 10312 | |
| b | 974 | 1.2% |
| l | 974 | 1.2% |
| m | 814 | 1.0% |
| Other values (2) | 974 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 78028 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 11286 | |
| o | 11126 | |
| t | 10632 | |
| d | 10312 | |
| s | 10312 | |
| k | 10312 | |
| p | 10312 | |
| b | 974 | 1.2% |
| l | 974 | 1.2% |
| m | 814 | 1.0% |
| Other values (2) | 974 | 1.2% |
products
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.0565302 |
| Minimum | 1 |
|---|---|
| Maximum | 35 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 88.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 9 |
| Maximum | 35 |
| Range | 34 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.9844663 |
|---|---|
| Coefficient of variation (CV) | 0.97642295 |
| Kurtosis | 13.00007 |
| Mean | 3.0565302 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.9327305 |
| Sum | 34496 |
| Variance | 8.9070389 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 4203 | |
| 2 | 2438 | |
| 3 | 1458 | 12.9% |
| 4 | 1019 | 9.0% |
| 5 | 641 | 5.7% |
| 6 | 444 | 3.9% |
| 7 | 278 | 2.5% |
| 8 | 183 | 1.6% |
| 9 | 153 | 1.4% |
| 10 | 127 | 1.1% |
| Other values (21) | 342 | 3.0% |
| Value | Count | Frequency (%) |
| 1 | 4203 | |
| 2 | 2438 | |
| 3 | 1458 | 12.9% |
| 4 | 1019 | 9.0% |
| 5 | 641 | 5.7% |
| 6 | 444 | 3.9% |
| 7 | 278 | 2.5% |
| 8 | 183 | 1.6% |
| 9 | 153 | 1.4% |
| 10 | 127 | 1.1% |
| Value | Count | Frequency (%) |
| 35 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 28 | 3 | < 0.1% |
| 26 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 24 | 4 | < 0.1% |
| 23 | 4 | < 0.1% |
| 22 | 15 |
| visitor_id | total_transaction_revenue | total_hits | total_pageviews | total_time_on_site | visit_number | products | channel_grouping | browser | traffic_source | kmeans_cluster | dbscan_cluster | agg_cluster | device_category | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| visitor_id | 1.000 | 0.004 | 0.014 | 0.018 | 0.025 | -0.008 | 0.011 | 0.031 | 0.023 | 0.034 | 0.017 | 0.000 | 0.018 | 0.020 |
| total_transaction_revenue | 0.004 | 1.000 | 0.302 | 0.276 | 0.202 | 0.223 | 0.517 | 0.061 | 0.057 | 0.035 | 0.035 | 0.000 | 0.031 | 0.000 |
| total_hits | 0.014 | 0.302 | 1.000 | 0.981 | 0.661 | -0.009 | 0.561 | 0.064 | 0.018 | 0.158 | 0.451 | 0.073 | 0.377 | 0.089 |
| total_pageviews | 0.018 | 0.276 | 0.981 | 1.000 | 0.691 | -0.030 | 0.529 | 0.072 | 0.000 | 0.178 | 0.405 | 0.050 | 0.365 | 0.096 |
| total_time_on_site | 0.025 | 0.202 | 0.661 | 0.691 | 1.000 | -0.031 | 0.345 | 0.080 | 0.045 | 0.203 | 0.378 | 0.081 | 0.324 | 0.109 |
| visit_number | -0.008 | 0.223 | -0.009 | -0.030 | -0.031 | 1.000 | 0.127 | 0.143 | 0.135 | 0.130 | 0.027 | 0.000 | 0.024 | 0.000 |
| products | 0.011 | 0.517 | 0.561 | 0.529 | 0.345 | 0.127 | 1.000 | 0.026 | 0.009 | 0.000 | 0.184 | 0.046 | 0.201 | 0.044 |
| channel_grouping | 0.031 | 0.061 | 0.064 | 0.072 | 0.080 | 0.143 | 0.026 | 1.000 | 0.138 | 0.693 | 0.176 | 0.031 | 0.173 | 0.213 |
| browser | 0.023 | 0.057 | 0.018 | 0.000 | 0.045 | 0.135 | 0.009 | 0.138 | 1.000 | 0.163 | 0.274 | 0.018 | 0.276 | 0.335 |
| traffic_source | 0.034 | 0.035 | 0.158 | 0.178 | 0.203 | 0.130 | 0.000 | 0.693 | 0.163 | 1.000 | 0.138 | 0.003 | 0.109 | 0.171 |
| kmeans_cluster | 0.017 | 0.035 | 0.451 | 0.405 | 0.378 | 0.027 | 0.184 | 0.176 | 0.274 | 0.138 | 1.000 | 0.062 | 0.880 | 1.000 |
| dbscan_cluster | 0.000 | 0.000 | 0.073 | 0.050 | 0.081 | 0.000 | 0.046 | 0.031 | 0.018 | 0.003 | 0.062 | 1.000 | 0.080 | 0.060 |
| agg_cluster | 0.018 | 0.031 | 0.377 | 0.365 | 0.324 | 0.024 | 0.201 | 0.173 | 0.276 | 0.109 | 0.880 | 0.080 | 1.000 | 0.990 |
| device_category | 0.020 | 0.000 | 0.089 | 0.096 | 0.109 | 0.000 | 0.044 | 0.213 | 0.335 | 0.171 | 1.000 | 0.060 | 0.990 | 1.000 |
| visitor_id | visit_date | total_transaction_revenue | channel_grouping | browser | traffic_source | country | cities | region | total_hits | total_pageviews | total_time_on_site | visit_number | kmeans_cluster | dbscan_cluster | agg_cluster | device_category | products | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 213131142648941 | 2017-04-28 | 39590000 | Direct | Chrome | (direct) | United States | Spokane | Washington | 14 | 13 | 272 | 1 | 0 | -1 | 2 | desktop | 1 |
| 1 | 435324061339869 | 2016-10-21 | 46790000 | Referral | Chrome | (direct) | United States | Pooler | Georgia | 14 | 11 | 627 | 2 | 0 | -1 | 2 | desktop | 1 |
| 2 | 562678147042735 | 2017-04-24 | 62610000 | Referral | Chrome | (direct) | United States | Chesterfield | Missouri | 18 | 16 | 319 | 2 | 0 | -1 | 2 | desktop | 1 |
| 3 | 562678147042735 | 2017-04-24 | 97700000 | Referral | Chrome | (direct) | United States | Chesterfield | Missouri | 18 | 16 | 319 | 2 | 0 | -1 | 2 | desktop | 1 |
| 4 | 585708896049892 | 2016-12-21 | 45970000 | Referral | Chrome | (direct) | United States | Murfreesboro | Tennessee | 22 | 20 | 634 | 1 | 0 | -1 | 2 | desktop | 3 |
| 5 | 852801263780322 | 2017-06-27 | 80000000 | Direct | Chrome | (direct) | United States | Morris Heights | New York | 28 | 22 | 675 | 1 | 0 | 0 | 2 | desktop | 3 |
| 6 | 1123528056036404 | 2016-12-12 | 98960000 | Direct | Chrome | (direct) | United States | Portland | Texas | 35 | 31 | 677 | 1 | 0 | -1 | 2 | desktop | 4 |
| 7 | 1905118576359487 | 2016-10-17 | 22190000 | Referral | Chrome | (direct) | United States | Winston-Salem | North Carolina | 26 | 25 | 1588 | 5 | 0 | -1 | 2 | desktop | 2 |
| 8 | 2527528149176601 | 2016-12-02 | 17990000 | Organic Search | Safari | (direct) | United States | Lowell | Massachusetts | 15 | 13 | 348 | 1 | 3 | -1 | 1 | mobile | 1 |
| 9 | 2709834583138581 | 2016-12-16 | 9990000 | Direct | Firefox | (direct) | United States | Casper | Wyoming | 15 | 15 | 614 | 1 | 0 | -1 | 2 | desktop | 1 |
| visitor_id | visit_date | total_transaction_revenue | channel_grouping | browser | traffic_source | country | cities | region | total_hits | total_pageviews | total_time_on_site | visit_number | kmeans_cluster | dbscan_cluster | agg_cluster | device_category | products | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 11276 | 9989256027389984768 | 2016-10-30 | 68980000 | Organic Search | Chrome | United States | Altadena | California | 36 | 26 | 1262 | 2 | 3 | -1 | 1 | mobile | 3 | |
| 11277 | 9989795984216870912 | 2017-02-06 | 210720000 | Referral | Chrome | (direct) | United States | Maryvale | Arizona | 74 | 54 | 1101 | 6 | 0 | -1 | 2 | desktop | 6 |
| 11278 | 9990183617359421440 | 2017-03-30 | 26380000 | Organic Search | Chrome | United States | Longview | Washington | 20 | 16 | 635 | 1 | 0 | -1 | 2 | desktop | 2 | |
| 11279 | 9990183617359421440 | 2017-04-27 | 133120000 | Organic Search | Chrome | United States | Longview | Washington | 25 | 20 | 1814 | 6 | 0 | -1 | 2 | desktop | 3 | |
| 11280 | 9990797196896346112 | 2017-04-21 | 47250000 | Organic Search | Chrome | United States | Fountain Hills | Arizona | 40 | 26 | 756 | 1 | 0 | -1 | 2 | desktop | 3 | |
| 11281 | 9991633376050114560 | 2017-02-18 | 35590000 | Social | Chrome | plus.google.com | United States | Newton | Kansas | 17 | 16 | 386 | 1 | 0 | -1 | 2 | desktop | 1 |
| 11282 | 9994767073213036544 | 2016-08-09 | 140320000 | Organic Search | Chrome | United States | Omaha | Nebraska | 42 | 30 | 755 | 6 | 0 | -1 | 2 | desktop | 6 | |
| 11283 | 9997409246962677760 | 2016-12-09 | 40360000 | Referral | Chrome | (direct) | United States | Lafayette | Indiana | 86 | 65 | 1423 | 2 | 0 | -1 | 0 | desktop | 5 |
| 11284 | 9998597322098587648 | 2016-08-01 | 102200000 | Direct | Chrome | (direct) | United States | Newburg | Kentucky | 37 | 33 | 2041 | 1 | 0 | -1 | 2 | desktop | 2 |
| 11285 | 9998996003043229696 | 2016-11-17 | 66980000 | Organic Search | Chrome | (direct) | United States | Racine | Wisconsin | 16 | 16 | 412 | 1 | 0 | -1 | 2 | desktop | 2 |